Reducing Redundancy in Keyword Query Processing on Graph Databases

نویسنده

  • Chang-Sup Park
چکیده

In this paper, we propose a new approach to reducing redundancy in the answers to a keyword query over large graph databases. Aiming to generate query results which are not only relevant but also has diverse structures and content nodes, we propose a method to find top-k answer sub-trees which should be in reduced forms and duplication-free in regard to the set of content nodes. To process keyword queries efficiently over large graph data, we suggest an efficient indexing scheme on the most relevant paths from nodes to keyword terms in the graph. We present a top-k query processing algorithm which exploits the pre-constructed indexes to search for a set of most relevant and non-redundant answers. We also provide a state space search algorithm to find most relevant duplication-free answers in an efficient way. We show effectiveness and efficiency of the proposed approach in comparison with the previous methods using extensive experiments on real graph datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effective Path-aware Approach for Keyword Search over Data Graphs

Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...

متن کامل

Towards a new Foundation for Keyword Search in Relational Databases

The idea of querying relational databases using keywords emerged a decade ago [4] as a way to provide an high-level access to data and free the user from the knowledge of query languages and data organization. The common approach to this problem is as follows: the database is viewed as a graph G in which the nodes represent tuples and the edges represent foreign key references between them, a q...

متن کامل

Diversified Top-k Keyword Query Interpretation on Knowledge Graphs

Exploring a knowledge graph through keyword queries to discover meaningful patterns has been studied in many scenarios recently. From the perspective of query understanding, it aims to find a number of specific interpretations for ambiguous keyword queries. With the assistance of interpretation, the users can actively reduce the search space and get more relevant results. In this paper, we prop...

متن کامل

Keyword Proximity Search on XML Graphs

XKeyword provides efficient keyword proximity queries on large XML graph databases. A query is simply a list of keywords and does not require any schema or query language knowledge for its formulation. XKeyword is built on a relational database and, hence, can accommodate very large graphs. Query evaluation is optimized by using the graph’s schema. In particular, XKeyword consists of two stages...

متن کامل

Keyword search across distributed heterogenous structured data sources

Many applications and users require integrated data from multiple, distributed, heterogeneous (semi-) structured sources. Sources are relational databases, XML databases, or even structured Web resources. Mediator systems represent one class of solutions for data integration. They provide a uniform view and uniform way to query the virtually integrated data. As data resides in the local sources...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Inf. Sci. Eng.

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2018